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Abstract 

i In this note wc shortly rccaU the importance of last-hitting times in theory and appli- 

cations of optimal stopping. As a small contribution to this domain we then propose 
a concise proof of David Williams' decomposition of the Bessel Process of dimension 
3 (BES(3)), starting from r > at its ultimate minimum. This discussion is strongly 
motivated by our interest in properties of last hitting times in general, and here specif- 
ically, directly linked with the forthcoming reading guide of Nikeghbali and Platen on 
this subject. 
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^ 1 Introduction 

^ Only in trivial cases, last hitting times are at the same time stopping times because the 

"current- measurability" requirement is usually not satisfied. Hence it is typically harder 
to deal with last hitting times than with stopping times for which we know quite an 
impressive collection of Theorems and tools. As Chung (see citation of Nikeghbali and 
Platen(2012)) among others conclude, last hitting times must therefore be avoided at all 

. ^ costs. 

This is one way to see things, but one should admit that, often enough, reality looks 
^ somewhat different. Indeed, ironically, many interesting problems in the theory of optimal 

stopping require us to deal with last hitting times, and not with stopping times. And so, 
the attitude has changed, and the work of Jeulin (1980) and others had quite an influence 
on this development. 

In their recent paper, Nikeghbali and Platen cite several interesting examples from the 
domain of Mathematical Finance. In order to add to the motivation, we would like to 
slightly broaden the horizon and look at a few other examples. 

Examples and Motivation 

Best-choice problems, or secretary problems, arc typical representatives of a last-hitting 
time problem, namely problems of stopping on the last "improvement" of a stochastic 
process. In some of these problems, the difficulty stemming from the last-hitting time 
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character disappears. To give a very simple example, suppose we observe sequentially 
variables Xi,X2,--- and would like to maximize, for a given objective function /, the 
expected total return, that is we seek 

argmax/(Xi,X2, • • • ,Xr). 

T 

Suppose now that the optimal payoff for stopping after time t does not depend on Tt, 
where (J^s) denotes the natural filtration. Then 

supE(/(Xi,X2,--- ,Xr\Tt))=snpE{f{Xt+i,Xt+2,--- ,^r)) 

T>t T>t 

so that RHS as well as Xi,X2, ■ ■ • ,Xt arc both J^^-measurable. Hence it suffices to compare 
at each time t the value f{Xi,X2, • • • ,Xt) with the RHS supremum in order to take the 
optimal decision. 

Harder last-hitting time problems 

In more difficult problems the J-^-independence is typically no longer satisfied. How- 
ever, external information about the underlying process may help us to change nevertheless 
the last-hitting time problem into a tractable stopping problem. See for example the last- 
arrival problem (Bruss and Yor (2012)) which is a continuous time problem, and where 
the relevant external information about the underlying process is implied by a related 
martingale. There may be other examples of such an approach. 

There arc also certain other problems where the last hitting time objective is hiding 
behind other objectives, as for example the objective to discover the first time a random 
subset of a given set becomes complete. We give one specific example of this in the 
important field of clinical trials, more precisely, in so-called compassionate-use clinical 
trials. 

In such trials, a sequence of patients is treated with a drug (sometimes without FDA- 
approval) which may have serious side effects, the only justification being that it may be 
the last hope. (These trials require a special written consent of patients.) Treatments 
are typically sequential so that the physician or statistician may learn form preceding 
observations. 

A little reflection shows that these trials pose a difficult ethical problem. The consci- 
entious physician should try to save all lives which can be saved, and, at the same time 
avoid all unnecessary sufferings caused by the treatment. Since he or she is no prophet, 
the goal must be to stop (in a given sequence of patients within a fixed horizon) with 
maximum probability with the first patient completing the random subset of successes 
that is stop with the last success. Indeed, then all successes are covered, whereas the 
remaining patients (de facto not savable by the drug) do not have to suffer unnecessarily. 
If the success probability for each patient is known beforehand , then the optimal strategy 
follows immediately from the odds-algorithm (see Bruss (2000)). However, the physician 
may have almost no information about the respective success probabilities, and then the 
general solution of the optimal stopping problem is an open problem. 

These examples second the motivation of the studies of Nikeghbali and Platen, as 
well as ours in the present note. Last-hitting times are often important. Admittedly, 
no model in these papers seems tractable enough to deal for instance with the general 
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compassionate-use stopping problem described above. However, this indicates that it may 
be worth trying to view last-hitting times from any possible angle, and this is what we 
try in this paper. 

Our goal is to add to the understanding by looking at what can be done with en- 
largements of filtrations (in this case together with Girsanov's Theorem), even though we 
confine our interest to special processes. 

The BES(3)-process and Williams' Theorem 

(1.1) In their survey about last passage times, the Nikeghbali and Platen (2012) illustrate 
some of their formulae with the following example: 

Let {Rt)t>o be a BES(3)-process on M+ starting from r > 0. Denote by {J^t)t>o its 
natural filtration, and let It denote the current infimum of the process R at time t, that 
is, 

/( = inf Rs. 

s<t 

The following results can be found in Nikeghbali and Platen around Corollary 4.10: 

(a) loo follows the same distribution as the random variable rU, where U is uniform 
on [0,1]. 

(b) The Azema-supermartingale associated with the random time g at which the pro- 
cess (R) reaches loo is given by 

Zt = Pig > t\^t) = ^. 

Rt 

(c) The Laplace transform of the law of g is 



(d) The density of g denoted by p{t) equals 

p(t) = ^^fl-e-('^V2t)y 



'2^1 r 

Our aim in the remaining part of this note is to show Williams' decomposition of a 
BES(3)-process at its ultimate minimum, and how this decomposition is closely connected 
with (a)-(b)-(c)-(d). 

(1.2) Recall that if {Bt)t>o is a Brownian motion starting from and a is a real constant, 
then the law of the first hitting time of a by (Bt), denoted by Ta is given by 

G dt] = , ^ I a I cxp I I . (1) 

This well-known fact allows us to rewrite the statements (c) and (d) above as 

9 = t!:S\ (2) 

where U is independent of (B) and uniform on [0, 1], and where = denotes identity in law. 
This can be verified using (c) and (d). In fact, (2) may be understood via the classical 
decomposition of the process (R) before and after time g, due to Williams (1974). 
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2 Williams' decomposition of (R), before and after g, via 
progressive enlargement 

(2.1) Figure 1 describes well the decomposition of a BES(3)-process. 




soo 400 eoo eoo 1.000 isoo 



Figure 1: This graph presents a simulation of a Bessel process of dimension 3 based 
on three independent simulations of 1/2, l/2]-random walks 5"^, 5^, S"!, where k runs 
from 1 to 1200. The starting point is chosen = 4,Sq = 4, Sq = 2, so that the starting 
level of the simulated process is Bq = r = 6. The minimum height is indicated by the 
supporting horizontal line. In this simulation it equals 1.41963. Note that this is /1200 and 
not /oo, of course. Its level would be uniformly distributed on [0, Bq] = [0, 6] if the horizon 
were infinite. 

Note that this figure is nothing else but a (simulated finite-horizon) version of the Figure 
5 in Revuz-Yor (1999) (see Proposition 3.10 and Theorem 3.11 in Ch. 6, Sect. 3) where 

the BES(3)-proccss is considered starting from level c := r. 

(2.2) We now state precisely Williams' Decomposition Theorem before and after time 
9- 

Theorem 1 (Wilhams (1974)) 

Consider the following three independent random objects: 
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(i) a Brownian motion (Bl)t>o with B'q = r > 0; 

(ii) a uniform random variable U on [0, 1]; 

(iii) a BES(3)-process {Rt)t>o with = 0; 
Then the process (R) defined by 

R. = l^'^ - (3) 
\rU + Rt-g ,if t>g 

with g = inf{n > : B'^ = rU} is a BES(3)-process starting from r > 0. 



We note that the pre-y-Browian- motion found in (3) explains the result (2). Indeed, if 
Bl = r- Bf^ then 

5 = inf |u > : = r(l - [/)} , (2') 

which implies (2). 

2.3 Wc now proceed to the proof of the Theorem via the enlargement formula which 
describes the additive decomposition of the BES(3)-process {Rt) in the filtration (J^f )) 
containing the filtration (Jt), and making g a stopping time. 
Firstly, we have 

r. f* ds 

Rt = r + Bt+ — , (4) 

Jo -Ks 

where (B^) is a Brownian motion with respect to the filtration {J-t)- 
Secondly, the enlargement formula (see e.g. Jeulin (1980)) yields 

Jo Jg ~ 

with (Bl) being a Brownian motion with respect to (J^f). 
Thirdly, we deduce from (b) the two identities 

d<B,Z>u du 

y = -^,fovu<g (6) 

and 

d < B,l - Z >u ^ Ippdu for u > 5 (7) 

1 Zy^ Ry^{Ry^ loo) 

These two identities imply (using (4) and (6)) , and also 

1 loo 1 



+ 



Ru Ru{Ru -^oo) Ru loo 

the form of the pre-gi-process and the form of the post-^-process. 

Finally, for the proof of (3) to be complete, it remains to prove that the process {B') 

c 

is independent of the random variable I^q = rU, or more precisely, that, given /qo = a, 
the pre-(7-process is just the process (-B^)u<T' with obvious notation. This is asserted in 
the following proposition: 
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Proposition: Let ($u)u>o be a non-negative predictable process on patii-space. Fur- 
ther, let Pr denote the law of the process {R) starting from r and let denote the law of 
the Brownian motion (B') starting from r. Then, for a < r, 

Er[^g\Ioo=a]=Er[^Ta\Ta<Oo] (8.1) 

= E, [$tJ/oo < a] (8.2) 
= E;[$(S;;«<r^)] (8.3) 



Proof: The equality between the RHS of (8.1) and (8.3) follows, as we will show, from 
Doob's absolute continuity relationship, namely 



MTo 



on the canonical path-space C([0, oo],M), where (Xt) denotes the coordinate process on 
path-space. Indeed, this equality may be extended when replacing the time t by a stopping 
time. Restricting J^Ta on the set {Ta < oo} we get then in particular 

Pr/iJ'T. n {Ta < oo}) = (^) P^J^T^, < a < r, 
which yields the desired result. 

identity (8.2) is obvious, since the equality {Ta < oo} = {/qo < a} holds P^-almost surely. 
The proof of the equality (8.1) is slightly more subtle. We start with the identity 



E [l{g<ty^{I^)] =e\{1- Zt) f ^{Is)d{l - Zs) 
V Jo 

which holds for any Borel- measurable function (p : [0, oo[— > M+. To see this, note that 

E[i{g<tMit)]=E[ii-ZtMit)]. 

Assuming (p e the latter becomes by partial integration 



(9) 



E 



/* ip'{ls)dls{l - Zs)] +e\ [' ip{Is)d{l - Zs) 
Jo J Uo 



We note that the expectation involving cp' vanishes, since 1 — Zg vanishes dig almost 
everywhere. Thus a monotone class argument implies that (9) holds for every non-negative 
Borel-measurable function (p. 

Next, from the additive decomposition of (1 — Zg), we obtain 



r / dl 

E[l{,<i}<^(7oo)] =E^y^ <piQ i^-j' 

Since {It < a} = {t > Ta} the latter can also be written as 



= E 



da 

p{a) — 
Uh « . 
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so that from (9) 



/ '^(a)-^l{T„<t} 



(10). 



Now note that the identity (10) still holds if we replace t by a generic stopping time. 
Applying again the monotone class theorem gives us then 



E i^gipiloo)] = E 



/ <^(a)^$T,l{r,<oo} 
.Jo 



(11) 



Finally, using loo = fU with U being uniform on [0,1] under (see (2)), we see that 
identity (11) implies identity (8.1). 

□ 



3 Concluding remarks 

The statement of the theorem invites for a proof chosing between, on the one hand, 
initial enlargement with Iqo, and, on the other hand, progressive enlargement with g. 
However we have not exactly proceeded like this; the Proposition plays the role of the 
initial enlargement method and relies on a classical Girsanov relationship between and 
P' 

In conclusion we find it interesting to present the above as an example of the potential 
of enlargement techniques, and here specifically, of a melange of enlargement techniques 
and Girsanov's theorem. Having said so, we know that this approach can in priciple 
be done for higher dimensions; however, the corresponding Theorem 1 would look more 
complicated. 
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